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Abstract. It is often assumed that the halo-patch fluctuation field can be written as a 
Taylor series in the initial Lagrangian dark matter density fluctuation field. We show that 
if this Lagrangian bias is local, and the initial conditions are Gaussian, then the two-point 
cross-correlation between halos and mass should be linearly proportional to the mass-mass 
auto-correlation function. This statement is exact and valid on all scales; there are no higher- 
order contributions, e.g., from terms proportional to products or convolutions of two-point 
functions, which one might have thought would appear upon truncating the Taylor series of 
the halo bias function. In addition, the auto-correlation function of locally biased tracers can 
be written as a Taylor series in the auto-correlation function of the mass; there are no terms 
involving, e.g., derivatives or convolutions. Moreover, although the leading order coefficient, 
the linear bias factor of the auto-correlation function is just the square of that for the cross- 
correlation, it is the same as that obtained from expanding the mean number of halos as a 
function of the local density only in the large-scale limit. In principle, these relations allow 
simple tests of whether or not halo bias is indeed local in Lagrangian space. We discuss why 
things are more complicated in practice. We also discuss our results in light of recent work 
on the renormalizability of halo bias, demonstrating that it is better to renormalize than not. 
We use the Lognormal model to illustrate many of our findings. 
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1 Introduction 

Galaxies are biased tracers of the dark matter distribution [1, 2]. In the local bias model, the 
abundance of the biased tracers at a given position is assumed to be related to the mass at the 
same position. The simplest version of this model, in which the smoothed overdensity field 
of the biased tracers 5b is treated as though it were a deterministic function of the (similarly 
smoothed) real-space mass overdensity field S m , has been the subject of much study [3, 4]. 
Following ref. [5], it has become common to write the local model as 

4 = /W=E^( f m-(4,» ) (1-1) 
i>0 

where bi is the bias coefficient of order i. Note that this model ensures (5b) = by subtracting- 
off the (Sin) terms. 

One of the points we make in what follows is that one should, instead, normalize using a 
multiplicative factor which ensures that (1+5&) = 1. I.e., given an expansion of the form (1.1), 
one should always work with 

_ 1 + 5 b - (1 + 5 b ) 

= (l + 5 h ) ■ (L2) 

Although the model was invoked to describe the bias with respect to the late-time 
nonlinear Eulerian field 5^, since it is only invoked on large scales, it is often assumed, and 
sometimes explicitly used, to describe the bias with respect to the initial Lagrangian field 5^ 
(on large scales, these are expected to be similar). However, the two best studied models of 
bias with respect to the Lagrangian field - those associated with peaks, and patches which 
form halos, in a Gaussian random field - behave rather differently from naive expectations 
based on the local bias model. In particular, in the local bias model calculation of the cross- 
correlation between the biased tracers and the initial field, (5f,5 m ), one proceeds by writing 
the bias function as a Taylor series, and then expanding order by order in 5 m . This means 
that one expects higher-order terms to contribute. However, for peaks and in Gaussian initial 
conditions, the exact answer is (5b5 m ) = b\ (5^) [6]. Therefore, in the expansion referred to 
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above, all the higher-order terms must cancel out. We show below that this is also true for 
another popular choice of the bias - a Lognormal mapping between the biased field and 5 m - 
and then that this is generically true in models where the bias is local and deterministic with 
respect to the initial Gaussian random field. I.e., the cross-correlation is always only linearly 
proportional to the auto-correlation signal of the dark matter, although, in general, the linear 
bias factor need not equal b\ of the Taylor series. We then show that the auto-correlation 
function of the biased tracers can always be written as a Taylor series in the auto-correlation 
function of the original (unbiased) mass fluctuation field; derivatives and convolutions do 
not enter. Along the way, our analysis connects to recent work on renormalized bias [7], 
showing, e.g., that this renormalization is required if the Lognormal mapping is to make 
sensible predictions. 

We have structured the discussion as follows. In section 2.1 we discuss the Lognormal 
mapping, since it turns out that all quantities of interest can be computed exactly (no 
truncation of the sums is required). These exact expressions exhibit some curious properties 
which are not obvious if one trucates the sums. Then we show that how one normalizes such 
purely formal expansions plays an important role, and why eq. (1.2) is to be preferred over 
eq. (1.1). Section 2.3 explores a more complex example in which some of this simplicity is lost, 
before showing the general result in section 2.4. Section 2.5 shows how to proceed if the full 
expansion is not available, so one is constrained to work with a truncated series. Section 2.6 
connects this analysis to some of the earliest work on this subject (ref. [3]), discussing how the 
coefficients of the Taylor series expansion of 5 b in terms of 5 m are related to those obtained 
from expanding £bb in terms of £ mm . Section 2.7 discusses halo bias in the context of these 
results, and section 2.8 revisits a technical point about halo bias, first made by ref. [8] but 
often overlooked, which complicates the use of cross-correlations for testing the hypothesis 
that Lagrangian halo bias is indeed local, a subject of much recent interest [9-12]. A final 
section summarizes. 

2 Local Lagrangian bias 

2.1 The Lognormal model and the usual expansion 

We begin with the usual model, eq. (1.1), with 5 m = 5l to emphasize the fact that the mass 
field is for Lagrangian space, and recall that the terms are inserted to guarantee that 
(5b) = 0. As an explicit example, we will consider the Lognormal transformation, which aims 
to map 1 + 5t> — > exp(6(5x,). The parameter b is the one free parameter of this transformation: 
large values of b strongly enhance large values of 5l- 

Since the coefficients in the Taylor series are simply b^ = b k , for a Gaussian distribution 
of 5l, the additive correction terms can be summed explicitly: 

E | <#> = E % <#> = eMb 2 (6 2 L )/2) - 1. (2.1) 

fe>0 ' fc>0 
This means that, in this expansion, the Lognormal model is really 

5 b = exp(65 L ) - 1 - [exp(6 2 (5 2 L )/2) - 1] = exp(6<5 L ) - exp(6 2 (5 2 L )/2). (2.2) 

Notice that, as a result of this additive normalization term, 5b ^ when 5l = 0. 
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The cross-correlation between the biased field and the original one can also be done 
analytically: 

<w = E y <^ +1 > = b ^) -p (^r) (2 - 3) 

fc>0 ' ^ ' 

where the final expression shows the result of evaluating the sum for the Lognormal coeffi- 
cients. Finally, the auto-correlation, which can also be done analytically, yields: 

(8 2 b ) = <[exp(W) - exp(6 2 <5 2 >/2)] 2 > = exp(6 2 <5 2 »[exp(& 2 (<5 2 )) - 1]. (2.4) 

2.2 Additive versus multiplicative normalization of the Lognormal mapping 

The treatment above ensures that (5b) = by making an additive correction. However, 
because this additive correction term has, in effect, shifted the mean value of the transfor- 
mation, one should really account for the fact that the definition of the mean density, with 
respect to which one would like to define the biased fluctuation field, has also been modified. 
This modification corresponds to accounting for the fact that, prior to adding these terms, 
(1 + ^)^1- 

If one enforces (1 + 5^) = 1 by using a multiplicative factor rather than by an additive 
one, then one would write the Lognormal transformation as 

1 + 5 b = exp(M L ) exp(-6 2 (5|)/2). (2.5) 

The first term on the right hand side is the same deterministic transformation of the variable 
as before, and the second, exp(— b 2 (5\ )/2), is the multiplicative factor that is required to 
ensure that (1 + 5b) = 1. Note that this factor depends explicitly on the scale L on which 
the Lognormal transformation occurs, via (5\). On large scales, (5\) <C 1, making the 
overall normalization factor — > 1. In this limit, the Taylor series expansion of eq. (2.5) 
has coefficients bk = b k , but otherwise, these coefficients generally pick up (5 2 L ) dependent 
multiplicative correction factors. 

Integrating over the underlying Gaussian distribution of 5l shows that the cross-correlation 
between the biased field and the original one is 

((1 + 5 b )(l + 5 L )) = 1 + (5 b 5 L ) = 1 + 6 (5 2 L ), (2.6) 

whereas 

<(l + ^) 2 )=exp[6 2 <5 2 )]. (2.7) 

These expressions differ from eqs. (2.3) and (2.4) for (5(,5l) and (5 2 ) by one and two pow- 
ers, respectively, of the multiplicative normalization factor associated with the Lognormal 
transformation (see discussion following eq. (2.5)). Eq. (2.7) is the usual expression for the 
relation between the correlation function of the Lognormal field and that of the underlying 
Gaussian [13], suggesting that the present analysis is correct, and that of the previous sec- 
tion is not. I.e., normalizing 5b by subtracting a constant, rather than normalizing 1 + 
by a multiplicative constant is ill-advised. Moreover, notice that now the cross-correlation 
function is particularly simple: (S^Sl) = b(5\), with no higher-order terms (eq. (2.6)). 

For what follows, it is useful to write out slightly more general expressions for the cross- 
and auto-correlations. The cross-correlation between the Lognormally biased field and the 
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original Gaussian one smoothed on a different scale L' than that on which the bias was 
defined, and separated by a distance r, is 



(5 b 5 L ,\r) = b(5 L 5 v \r) = b£ LI/ (r), (2.8) 

where £z,z/(r) denotes the correlation between the initial field when smoothed on scale L 
and when smoothed on scale V and displaced by r. We note again that this expression is 
exact - it is remarkable that the cross-correlation function is just a linearly biased version 
of that of the underlying field, despite the fact that the transformation itself was highly 
nonlinear. Peaks in Gaussian fields exhibit this same simplicity [6]. Note in particular that 
the linear bias factor is the first term in the Taylor series of 5b in the (5\) — > limit. We 
show shortly that, although {5^5 y\r) oc (^l^l'I 7 *) i s generic, the constant of proportionality 
is not necessarily b\, the first term in the Taylor series of the mapping between 5b and 5l- 

The cross-correlation between two differently biased tracers b and b', defined using 
transformations on scales L and L', and separated by r, can also be computed exactly: 

l + 6b'(r) = ((l + 5b)(l + 5b>)) = exp[bb'Z LL ,(r)}. (2.9) 

This can be expanded as a series in ((Jl^l'I 7 ") to yield: 

£ 66 ,(r) « bb'(5 L 5 L ,\r) + {bb'f ^^ +... . (2.10) 

Therefore, if b = b' and L = V then £bb{r) « 6 2 £ll(0) and 

(5 b 5 L \r) 2 



(5 b 5 b \r) 



i LL {r). (2.11) 



It turns out that this is a generic feature of local Lagrangian bias; it holds even when the 
constant of proportionality between (#6#L|r) and (5\\r) differs from b\ of the Taylor series. 

Before moving on, we note that the ratio p 2 = {5i5i) 2 / {5 2 ){5\) is sometimes used to 
quantify the 'stochasticity' of the bias, where p 2 < 1 is taken to imply stochasticity. For the 
Lognormal mapping above, p 2 = x/(e x — 1) with x = 6 2 (<5| / ). This equals unity only when 
{5\) = 0, despite the fact that the Lognormal mapping is explicitly deterministic. Clearly, 
p 2 is not a good indicator of stochasticity. 

2.3 Approximate transformation for peaks 

Suppose one smoothes a Gaussian random field with a filter of scale L p . When the peak 
point process is smoothed with a filter of scale L > I p , then ref. [2] argue that it is useful 
to think of the smoothed field as defining a peak fluctuation field which is related to that of 
the initial Gaussian field by 

1 + 5 P = e*p(W L - c5 2 /2) yjl + c(5 2 ) exp (- f|^ ) • (2-12) 

(Note that we are normalizing using a multiplicative rather than additive factor.) The free 
parameters b and c are related to the properties of the peak (e.g., its height and curva- 
ture); they determine a rather complex normalization factor that, as before, depends on the 
smoothing scale via {5f) and tends to unity on large scales. In this limit, the first few bias 
coefficients associated with this transformation are b\ = b, b 2 = (b 2 - c), 6 3 = (6 3 - 36c), and 
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&4 = (b — 6b 2 c + 3c 2 ). (The structure of these terms means that b^ is very similar to the 
Hermite polynomial Hk{b); in fact bk = Hk(b) when c = 1. This happens because the bias 
relation has the same form, exp(bt — t 2 /2), as the generating function of the Hermites.) Note 
that the quantity 63 — 61&2 = — 2fec 7^ 0, in contrast to the Lognormal studied earlier. This 
will matter in section 2.5. 

For this relation, the cross-correlation between such peaks and the field at a different 
position when smoothed on a different scale is 

(S P 6L>\r)= 1 + b c{g2 £Lv(r). (2.13) 

As for the Lognormal transformation, this expression is exact, and it is linearly proportional 
to £z,z/( r )> there are no higher order terms. However, in this case, the constant of propor- 
tionality equals the first term in the Taylor series of 6 P only in the {5\) — > limit; in general, 
they are different. 

When the two smoothing scales are the same, then 

1 + ( 6p 5 p \r) = \l-C 2 e!x(r)l ~ 1/2 exp ( J?&lrL ) , (2.14) 



where 



B = --, 7727 and C = -> 7727- ( 2 - 15 ) 

l + c(S 2 L ) l + c(6 2 L ) 



To leading order in £ ££ , this reduces to 

(5 P 5 p \r) « B 2 £ LL (r) + ^ = ° ? Hdr) + " Q ^ ^ £ (r) + . . . (2.16) 



I.e., to leading order this transformation also satisfies eq. (2.11) even though this bias factor 
is not the same as the leading order term in the Taylor expansion. On the other hand, if we 
view this final expression as the Taylor series expansion of the correlation function, then the 
coefficients of this expansion have the same structure as the <tl — > limit of 5 P . We return 
to this shortly. 

2.4 Normalizing density rather than overdensity: The general case 

The previous sections showed that, for Lognormal-like, local, deterministic, Lagrangian bias 
functions, one should normalize the density of the biased field to unity (using a multiplicative 
factor), rather than the overdensity to zero (by subtracting a constant). We now study the 
general case. 

If one starts with an arbitrary bias function of the form given by eq. (1.1), then the 
correctly normalized bias field is simply defined by eq. (1.2): 

_ i+6 b -(i+6 b ) Er=i(WfcQ(*i -<*!)) . ^ 

Note that this renders the coefficient 60 redundant, so we can set it equal to unity (and 
redefine all other coefficients in units of bo). This expression only differs from eq. (1.1) 
because of the term in the denominator. In general, 5b 7^ when 5l = 0. 
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It is a simple matter to check that this works out correctly for a Lognormal. Moreover, 
it is straightforward to see that 



(6 L >5 B \r) 



zr=i(h/k\)(5 L ,6 k L \r) _ £r=i(Wfco<<% +1 )/(<%) 



tLL>{r)= B L £ LL ,(r), (2.18) 



where the second equality used the fact that (Sl>8l\t) = 5l £LL'( r )/(^i)> so {^L'8\\r) is 
just £,LL'{r) (<5l +1 )/(^I)- This shows that (5l'5 b \t) is linearly proportional to £r,i/(r), where 
the final expression defines the constant of proportionality B^. Note that this is an exact 
statement, valid for any r, L or L' , and for any local deterministic bias function. In particular, 
there is no requirement that r be large compared to either L or L', nor that V be larger 
than the scale L on which the local transformation from the dark matter to the biased field is 
(assumed to be) monotonic and deterministic. Furthermore, notice that, although Bl 7^ b\ 
in general, they are indeed equal in the limit (5\) — > 0. (Our eq. (2.18) is consistent with 
eq. (10) of [14], in the Lagrangian bias limit in which all their C pq = 0; but our formulation 
highlights the fact that, in their expansion, all the higher order terms are proportional to 
powers of (5\), whereas none are proportional to powers of £.) 

For similar reasons the auto-correlation function of the biased tracers will reduce to a 
series of the form 



This means that, to lowest order, eq. (2.11) is satisfied even in the general case. Note that 
the auto-correlation function of the biased tracers can always be written as a series in £. 
E.g., terms involving derivatives or convolutions of £ do not appear. Of course, this means 
that the power spectrum of a locally biased tracer will generically involve convolutions of the 
original P(k). In this sense, local bias is simpler in real space than it is in Fourier space. 

2.5 Renormalized bias 

The analysis above highlighted the fact that it was important to include the multiplicative 
normalization factor when enforcing (5 b) = 0. We illustrated this using the Lognormal, for 
which all the sums could be performed analytically. The question now arises as to what to 
do when this cannot be done. 

The analysis above suggests that one should redefine the mean density, and hence all 
bias factors, order by order. This corresponds to truncating eq. (1.2) rather than eq. (1.1): 



(5 B ,5 B \r) = Bl£ LL (r) + -f ^(r)] 2 + . . . 



(2.19) 



{j) = l + ^-(l + ^ = Ej =1 (h/k\)(5l-(5l)) 
+ % EU(b k /k\M) 



(2.20) 



Hence, the cross-correlation between the mass and biased fields is 



EU(h/k\)(5 k L ) 



(2.21) 



To 4th order in 5l, this is 




h (5 L >5 L \r) + (b 3 /3l)(5 L ,5l\r) 
l + (b 2 /2)(5l) + (b 4 /A\)(5i) 




63 _ 6162 
2 2 



<£i>j£zz'(r) = &?Wr): 
(2.22) 
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where the final equality defines • This differs from truncating eq. (f.f) because of the term 
which is proportional to b\ 62. For the Lognormal transformation (b k = b\), 63 = 6162 so the 
term that is higher order in a is zeroed out in this expansion, whereas it would survive if we 
had started from eq. (1.1). Thus, this approach correctly gets (5l5^) = 6£(r). In fact, one 

can show that, for the Lognormal, {^l^b) = b{5\) for any j. In this sense, this treatment 
is a significant improvement on the usual truncation of eq. (1.1). However, since 63 7^ 6162 
in general, this approach would also lead one to conclude wrongly that the cross-correlation 
includes higher order terms when it does not. For example, for the approximate peaks-bias 
relation of eq. (2.12), 63 — 6162 = —26c 7^ 0. On the other hand, note that this expression 
returns b^ = b — 2bc{5 2 L ) /2 = 6(1 — c{5 2 L )) which is indeed the exact answer, 6/(1 + c{5 2 L )), 
expanded to first order in (SV). That is to say, although the renormalization approach 
suggests more complexity than is present in the exact answer, it is at least self-consistent. 
A similar calculation for the auto-correlation yields 

<«V> - $(r) = (b { ??tLL{r) + b ^LL(rf. (2.23) 

Note that although truncating eq. (1.1) has the same form, in this case, the coefficients are 
correct for a Lognormal, illustrating again that this is a better approach. 

We are not the first to advocate normalizing by a multiplicative factor. Ref. [7] noted 
that this was advisable, especially in the context of truncated expansions. However, that anal- 
ysis did not highlight the fact that this makes the Lagrangian bias so simple (our eq. (2.18)). 
Rather, ref. [7] went on to consider the implications in Fourier space, arguing that the k = 
limit of the term which scales as b 2 /2 should be removed from the definition of the bias, and 
instead absorbed into a shot-noise like term. This procedure has the virtue of making the bias 
defined by the auto-correlation equal the square of that defined by the cross-correlation by 
definition even in Fourier space, which is the scaling satisfied by the exact answer. Ref. [15] 
argue that this may not be the best way to think about b 2 /2 (or the higher order bias 
coefficients). 

2.6 Relation to Szalay (1988) 

In all essential respects, the analysis above is simply a restatement of results in [3]. For 
y = 5L/{Sf j ) 1 ^ 2 , Szalay assumed that the bias function G(y) could have non-negative values 
only, and that 

(G(y)) = J d yG(y)^- = l. (2.24) 

Because G > 0, and (G) = 1, his G is, in effect, our 1 + 8b of eq. (2.17). He then expanded 
G in terms of Hermite polynomials: 

OO „ „ _„2 



Bi. r e~ y I 2 

G ^ = E TT H ^ where B k= dy — =- G{y) H k (y) = {G(y) H k {y)) (2.25) 



and 



H k (y) = e^/ 2 ^-±y e -y 2 / 2 , with (H m (y)H n (y)} = 8 nr 



(2.26) 
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The orthogonality of the Hermite polynomials allowed him to show that 

1/2/9 f _( , «„_,/i,„,/,12/9Y 1 _,,,2 N OO „ OO 



r e -2/?/2 r -(y 2 -w 12yi ) 2 /2(l-wj 2 ) °° d °° R 



'12^ fc=0 ' i=0 



E 



'* ^12 , (2-27) 



fc! 

k=0 

where w±2 = {yiU2\ r ) = CLL( r )/°"L- Comparison with eq. (2.19) shows that his Bj~ are our 
Bl-, Cl, etc., so they are complicated sums over the (renormalized) bias factors. This means 
they are, in general, cjl dependent combinations of the coefficients of the Taylor series 
expansion of 5h on scale L. 

Szalay did not compute the cross-correlation function, but it is straightforward to see 

that 

'mGiVi)) = I dyi 6 r— w 12 yi^2^-H k ( yi ) = w 12 ^■H h {y l )H 1 {y 1 )\ 
J ^ k=o ' \k=o ' I 

w^iBxH^yt) 2 ) =B lWl2 , (2.28) 



where we have used the fact that H±(y) = y and the orthogonality of the Hermite polynomi- 
als to simplify the expressions. Notice that the tracer-mass cross-correlation is indeed just 
linearly proportional to the auto-correlation of the mass, and the square of it is the leading 
order term of the auto-correlation function, consistent with eq. (2.11). 

We remarked earlier that the B k values depend on the scale on which the transformation 
is applied. It is a simple matter to check that setting B k = (b<JL) k yields the correctly 
normalized Lognormal mapping considered earlier (eq. (2.5)). Notice that in this case B k 
is a separable function of a scale dependent piece a\, and a constant piece whose value is 
given by the large scale ctl — > limit of the Taylor series expansion of Sh- That is to say, 
if one expands the (Lagrangian space) halo auto-correlation function in powers of the mass 
correlation then the coefficient of the kth order term in the expansion is simply B?, and 
this term is independent scale L, even though the bias coefficients in the Taylor expansion 
of the field 1 + 5h(5i) itself do depend L. Therefore, the bias parameters estimated from a 
scatter plot of l+5h versus 5l 'run' with scale L, whereas those estimated from the correlation 
function do not. 

We will return to this in the next section, but note that it is not generic. E.g., for the 
peaks transformation of eq. (2.12), B x = Ba L , B 2 = (B 2 - C) cr|, B 3 = (B 3 - 3BC) a\ etc., 
where B and C were defined in eq. (2.15). I.e., the B^ satisfy the same relations between 
the rescaled values B and C that the b k do for b and c, but B and C differ from the peak- 
background split values (i.e. the large scale Taylor series coefficients of 5h) b and c by a factor 
of 1 + ca\. 

2.7 Relation to halo bias from the excursion set approach 

The analysis above made the point that the leading order bias factor for the cross-correlation 
is the leading order term in the Taylor series expansion of 8^ only in the (large-scale) <tl <C 1 
limit. So it is somewhat surprising that the differences between these two are sufficiently 
small as to have not attracted significant attention. In part, this is because the large scale 
halo bias factors, as determined from the excursion set approach, satisfy 

b k = v k - 1 H k+1 (y)/6*, where v = 8 c /a h (2.29) 
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[16]. Here cj/j is related to the smoothing scale which contains the halo mass (i.e. 
p(47r/3)i?^ = Mh), and 5 C is the overdensity associated with halo formation. Since these 
bias factors were determined from a physically motivated model, rather than an arbitrary 
formal expansion, they are already correctly normalized, in the sense that (1 + 5h) = 1- This 
makes the renormalized 5b of eq. (2.17) equal to the original 5^, so the renormalized large 
scale bias factors are simply those in eq. (2.29). 

What is remarkable about the excursion set approach is that, for any smoothing scale 
(larger than that on which the halos were defined) the associated B k satisfy 

B k = crl b k (2.30) 

[6, 12]. I.e., just as for the Lognormal mapping (for which b k = b k ), the B k are separable 
functions of the scale independent piece b k and the scale dependent piece <tl. Note that 
this separability is not general; e.g., it does not apply for the mapping in eq. (2.12). When 
k = 1, this separability implies that (ShSh\r) = a^Biw(r) = b\a1w{r) = &i£(r). I.e., for 
excursion set halos, the cross-correlation measurement returns the large scale linear bias 
factor b\ whatever the smoothing scale L, and whatever the separation r. 

Inserting eq. (2.30) in eq. (2.27) indicates that if one expanded the halo auto-correlation 
function in powers of the mass correlation, then the coefficient of the /cth order term in the 
expansion is simply b|. Note again that this statement is not restricted to large scales. This 
simple prediction for halos in Lagrangian space has not been noticed before. 

In addition, the higher order correlations of the halos are given by Szalay's eq. (11). 
Like the two-point function, these can be written as sums of products of £. To order £ 3 , the 
three point function is given by 



Cl23 = b\b 2 (^12^23 + £23£l3 + 6.263) + b\ ^12^23^ 



13 



+&1&2&3 



c i £23 + £13 2 £12 + 63 t 2 62 + £23 

S12 7. r ?23 o 1 ?13 - 



(2.31) 



where &j denotes the two-point correlation function of the mass, smoothed on scale L at 
separation r^. For equilateral triangles, this simplifies to 

C eq (r) = 2>b\b 2 e LL (r) + blf LL {r) + 3 hb 2 b 3 (2-32) 

2.8 On the appropriate ensemble over which to average 

The analysis above, like essentially all previous analyses in the literature to date, makes 
a technical assumption about how to compute the ensemble averages in Lagrangian space. 
Namely, it assumes that, for Gaussian initial conditions, this average is over a Gaussian 
probability distribution function. However, although averaging over the initial (Gaussian) 
pdf is technically correct for peaks, it is known to be incorrect for patches which are destined 
to form halos [8]. This is because, in the excursion set definition of halos, 5l is required to 
be less than 5 C on all scales L larger than that on which the halo was defined: this constraint 
on all larger scales is, in effect, a nonlocal requirement. In practice, this means that the use 
of a Gaussian distribution for 5l is, formally, inappropriate. As shown in ref. [8] (see their 
eq. (17)), a more careful treatment of this averaging, with the appropriate replacement of 
the Gaussian pdf yields 



C — 2 

erf 



^2(^) + (^i 2 o + l)erfc(i/io/v / 2)-^2i/ 1 2 /vr e -^o/ 2 , where vf Q = v 2 {a 2 h /a 2 L -l). 

(2.33) 
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When (jl <C cr h , this expression asymptotes to £ x — > b\o\ (recall that b\ = Hi(v)/5 C ), but 
in general the simple constant linear bias of the cross-correlation function between halos and 
mass is spoilt. E.g., as <tl — > ah, the cross-correlation £ x tends to S c . 

It is a simple matter to extend this analysis to see how averages over the higher order 
Hermite polynomials associated with the higher order bias factors are modified, but this is 
beyond the scope of the present work. Measurements in simulations are needed to see if the 
range of scales over which the simple Gaussian averaged estimate is accurate - and hence all 
the power of the Hermite polynomials - is large enough to be interesting. 

3 Discussion 

In local bias models it is assumed that the biased field 1 + Sf, can be written as a function of 
the underlying density field at the same position. We showed that, if the underlying field is 
Gaussian, then the cross-correlation between the biased field and the original one is linearly 
proportional to the auto-correlation function of the original field. This is an exact result, 
valid on all scales, and is not a consequence of truncating expansions, etc. While this is 
implicit in some previous work (e.g. ref. [14]), it has not been highlighted before. 

If one has been careful to ensure that (1 + <$&) = 1 (by using eq. (1.2) rather than 
eq. (1.1)), then the constant of proportionality is easily related to the first coefficient of the 
Taylor series expansion of the biased field, although they are not equal in general (eq. (2.18)). 
In addition, we showed that, to leading order, the ratio of the square of the cross-correlation 
to the auto-correlation of the bias tracers equals the correlation function of the underlying 
field (eqs. (2.19) and (2.11)). 

We also explored the consequences of truncating these expansions, demonstrating that it 
is better to renormalize all truncated expansions (eq. (2.20)) than to not. In this respect, our 
results agree with ref. [7]. Indeed, although our work has concentrated on Lagrangian bias, the 
multiplicative normalization (our eq. (1.2)) also holds also for Eulerian mass field, although 
providing an explicit expression when the mass field is not gaussian is more complicated (see 
ref. [7] for the implications in Fourier space). 

Expanding the biased field using Hermite polynomials rather than powers of the over- 
density, as was done by ref. [3], provided a easy way to see a number of our results for local 
bias in the initial, Lagrangian, Gaussian field. Although the coefficients of this expansion 
are functions of the scale on which the bias transformation is applied (eq. (2.25)), for the 
Lognormal distribution, as well as for excursion set halos, the scale dependence of these coeffi- 
cients is trivial: Bk = b^a^ (eq. (2.30)), where bk are the scale-independent peak-background 
split bias factors. (The Lognormal has bk = b k , whereas excursion set halos have bk given by 
eq. (2.29).) Therefore, if one expands the (Lagrangian space) halo auto-correlation function 
in powers of the mass correlation, then the coefficient of the fcth order term in the expansion 
is simply b\ (eq. (2.27)). These coefficients for the expansion of 1 + t^hh m powers of £ll are 
independent of scale L, even though the bias coefficients in the Taylor expansion of the field 
1 + 5ft,((5x,) itself do 'run' with L. This property of halo bias in Lagrangian space has not been 
emphasized before. 

The 'running' of the 1 + Sh(5i) bias factors is most easily understood by noting that the 
Hermite polynomials arise naturally if one phrases the question of local bias in Fourier rather 
than real space [17], a connection made recently in ref. [12, in particular, their Appendix B]. 
Moreover, ref. [17] notes that it is better to work with 'renormalized' parameters c n rather 
than the original bias parameters b n (but see [12] for how this renormalization should actually 
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be done). In effect, this corresponds to working with our eq. (1.2) rather than eq. (1.1); our 
analysis shows why this is necessary. This connection to Fourier space bias is rich, because 
we showed that the simple relation B^ = b^a^ is not generic. For the peaks transformation 
of eq. (2.12), both bk and B\. 'run' with L. Nevertheless, the Bk satisfy the same relations 
between the rescaled values that the original bk (in the peak background split limit) do. 
Exploring whether this is generic is the subject of ongoing work. 

Unfortunately, although the analysis based on Hermite polynomials is formally correct 
(and elegant), there are two reasons why, at least for describing halos, it cannot be valid 
for all smoothing scales L and separations r. First, halos do not overlap; this makes £hh of 
the halo point process tend to —1 on scales smaller than ~ [18]. As a result, this halo 
exclusion limits the range in r over which the local model can be applied. The second is 
that, at least in the excursion set definition of halos, there is a nonlocal requirement on the 
density field: this was the subject of section 2.8, which argued that this modifies the pdf over 
which one should compute ensemble averages. Accounting for this makes the ratio of the 
halo-mass cross-correlation bias scale dependent (our eq. (2.33), which is eq. (17) of [8]). The 
small scale limiting value of this modified expression yields S c , which is the correct 'one-halo' 
contribution to the Lagrangian space cross-correlation (the Lagrangian overdensity within a 
region which is destined to form a halo equals 5 C by definition), a fact which has not been 
emphasized before. This suggests that averaging over the more appropriate pdf not only 
leads to sensible results, but accounts for halo exclusion as well, so it may be worth exploring 
further. This may be particularly interesting because, in the large separation limit, it leads 
to £ hh ~ b x < (eq. (27) of [8]). 

There is a third reason why, at least for describing halos, the local bias model is unlikely 
to be valid for all smoothing scales L and separations r. This is related to the fact that the 
tidal field influences halo formation [19]. The correlation with the tidal field leads, generically, 
to nonlocal bias even in Lagrangian space. This is explored further in ref. [20], where the 
nonlocal bias terms are shown to matter most for massive halos, though the nonlocal effects 
are subdominant on large scales. 

If the gravitationally evolved nonlinear Eulerian mass field was a locally biased version of 
the initial field (it is not), then expanding in Hermite polynomials would also be the prefered 
way of describing halo bias (assuming the initial conditions were Gaussian). This is because 
local Lagrangian bias, with local nonlinear evolution, leads to local bias with respect to the 
Eulerian field. Conversely local bias with respect to the Eulerian field could be mapped back 
to local Lagrangian bias (with different bias factors, determined by the local linear-nonlinear 
mapping) . Some of the scalings which are characteristic of local Lagrangian bias will survive 
in Eulerian bias as well. For example, the Lognormal transformation has one free parameter 
b. If we use b' to model the nonlinear mass field, and another b to model the biased tracers 
(incidentally, this means that one should think of the biased field l+<$& as being the nonlinear 
field l+5y raised to the b/b' power), then we may interpret our eq. (2.9) as the Eulerian space 
cross-correlation between the biased field and the nonlinear mass field. Since (S^S^r) and 
(SbiSL\r) equal b^LL(r) and 6'^ll(^) for all r, one might naively have thought that (<5jA'l r ) 
would also be linearly proportional to ^ll(t). Not only is this not true, eq. (2.9) shows that 
it is not proportional to £&'&'( r ) either. In particular, eq. (2.9) shows that in this model of 
local Eulerian bias, the cross-correlation is linearly proportional to the auto-correlation of 
the mass only when £ll *C 1 (i.e., for sufficiently large r). 

Recent work has emphasized the fact that because nonlinear evolution is nonlocal, local 
Lagrangian bias will lead to nonlocal Eulerian bias, and vice versa [10, 11, 17, 21]. The Her- 
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mite polynomials are the orthogonal polynomials associated with a Gaussian field. Therefore, 
the usefulness of Szalay's work for local Lagrangian bias suggests that, if bias is local with 
respect to the nonlinear, non-Gaussian field, then it would be natural to write Eulerian bias 
using the orthogonal polynomials of this non-Gaussian field. This is the subject of work in 
progress. 

Finally, we note that our results depend only on the assumption that the Lagrangian 
matter field is Gaussian, so one might have thought that they also hold for modified gravity 
models. However, for such models, ^-dependence of the linear growth factor is generic, 
with the consequence that the assumption that Lagrangian halo bias is local is no longer so 
attractive [22]. 
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